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Abstract 



Inclusive b-quark production in two-photon collisions has been measured at 
LEP using an integrated luminosity of 698 pb" 1 collected by the ALEPH 
detector with yfs between 130 and 209 GeV. The b quarks were identified 
using lifetime information. The cross section is found to be 

a(eV^eVbbX) = (5.4 ± 0.8 stat ± 0.8 syst ) pb, 

which is consistent with Next-to-Leading Order QCD. 
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1 Introduction 



The cross section for heavy flavour production in two-photon interactions is expected to 
be reliably calculated in perturbative QCD, particularly in the case of b-quark production, 
as the heavy quark mass introduces a relatively large scale into the process. The cross 
section has been calculated in Next-to-Leading Order (NLO) QCD to be between 2.1 and 
4.5 pb [1], which is two orders of magnitude smaller than that for charm production, 
which in turn is approximately 6% of the total cross section for hadron production. The 
latter is dominated by soft processes involving u, d and s quarks. The process of heavy 
flavour production in two-photon interactions at LEP energies is dominated by the two 
classes of diagrams shown in Fig. 1. These are referred to as the 'direct' process in which 
the photon couples directly to the heavy quark, and the 'single resolved' process in which 
one photon first fluctuates into quarks and gluons. This separation is unambiguous up 
to next-to-leading order due to the heavy quark mass [2]. In the resolved diagram the 
dominant process is photon-gluon fusion where a gluon from the resolved photon couples 
with the heavy quark. Heavy quark production via double resolved processes is highly 
suppressed at LEP energies [1]. 

The only measurement of b-quark production in two-photon collisions published to 
date is by the L3 Collaboration, obtained from a fit to the transverse momentum of 
leptons with respect to jets [3]: the cross section was measured to be about three times 
the prediction of NLO QCD. Similar results have been reported at conferences by OPAL [4] 
and DELPHI [5]. 

This paper presents a measurement of open b-quark production in data collected 
between 1996 and 2000 with an integrated luminosity of 698 pb -1 . During this period 
the LEP centre of mass energy ranged from 130 to 209 GeV, with a mean of 196 GeV. 
The result is the first published measurement in which lifetime information has been used 
to identify heavy flavour quarks in two-photon physics. The paper is organised as follows. 
Section 2 gives a brief description of the ALEPH detector, Section 3 presents the event 
generators used for the simulation of the signal and backgrounds, Section 4 describes the 
jet finding procedure employed, and Section 5 describes the b tagging procedure. The 
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Figure 1: Diagrams contributing to b-quark production in 77 collisions. 
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initial event selection based on cuts is described in Section 6, followed in Section 7 by 
the final selection which uses an event weighting procedure. In Section 8 the efficiency 
calculation is described, with the resulting cross Section given in Section 9. In Section 10 
the calculation of the systematic uncertainties is described, and in Section 11a number 
of cross checks are presented. Finally in Section 12 the final value for the cross section of 
open b-quark production is shown. 

2 ALEPH Detector 

The ALEPH detector has been described in detail elsewhere [6]. Critical to this analysis 
is the ability to accurately measure charged particles. These are detected in a large time 
projection chamber (TPC) supplemented by information from the inner tracking chamber 
(ITC) which is a cylindrical drift chamber sitting inside the TPC, and a two-layer silicon 
strip vertex detector (VDET) which surrounds the beam pipe close to the interaction 
point. The VDET was upgraded in 1996 for the high energy running of LEP. It consists 
of 48 modules of double sided silicon strip detectors arranged in two concentric cylinders. 
The resolution in r<p is 10 /zm, while that in z rises from 15 jum for tracks perpendicular 
to the beam direction to 50 fim for tracks at cos# = 0.85 [7]. Charged particle transverse 
momenta are measured with a resolution of Spt/pt = 6 x 10~ 4 p t © 0.005 (p t in GeV/c). 

Outside the TPC lies the electromagnetic calorimeter (ECAL) whose primary purpose 
is the identification and measurement of electromagnetic clusters produced by photons and 
electrons. It is a lead/proportional-tube sampling calorimeter segmented in 0.9° x 0.9° 
projective towers read out in three sections in depth. It has a total thickness of 22 
radiation lengths and a relative energy resolution of 0.18/v^E © 0.009, (E in GeV) for 
photons. Outside the ECAL, a superconducting solenoidal coil produces a 1.5 T axial 
magnetic field and the iron return yoke for the magnet is instrumented with 23 layers of 
streamer tubes to form the hadron calorimeter (HCAL). The HCAL has a relative energy 
resolution for hadrons of 0.85/ \fE (E in GeV). The outermost detector of ALEPH is a 
set of muon chambers which consist of two double-layers of streamer tubes. Near the 
beam pipe, 3 m from the interaction point on either side, are two luminosity calorimeters, 
the LCAL and SiCAL, which are electromagnetic calorimeters specifically designed to 
measure the luminosity via Bhabha scattering. 

The information from the tracking detectors and the calorimeters are combined in an 
energy flow algorithm [6]. For each event, the algorithm provides a set of charged and 
neutral reconstructed particles, called energy-flow objects. 

3 Monte Carlo Simulation 

The PYTHIA [8] Monte Carlo program was used to simulate the two-photon processes. 
The production of b and c quarks by the direct and resolved process was modelled 
separately using PYTHIA 6.1 with matrix elements including mass effects. For the 
resolved process the photon's parton distribution function was the PYTHIA default (SaS 
ID) [9]- 

The charm quark production cross section was normalised using the average of the 
measurements made at LEP2, a(e + e~ — > e + e~ccX) = 930 ± 120 pb [3, 10]. All remaining 
hadron production by two-photon collisions was simulated using the standard PYTHIA 
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machinery for incoming photon beams [11]. The result of this paper will be compared to 
a calculation which is valid for real photons (Q 2 ~ 0) so events with Q 2 > 6 were treated 
as a background and will be referred to as 7*7 events for the remainder of this paper. The 
background from e + e~ — > qq was produced using the KK Monte Carlo program [12] . 

4 Jet Finding 

The direction of partons in an event was estimated using jets found with a dedicated 
jet finder (PTCLUS) that optimises the reconstruction of resolved events. The PTCLUS 
algorithm consists of three steps. 

• The most energetic energy flow object is taken as the first jet initiator. The 
algorithm then loops through all the remaining objects in order of decreasing energy. 
If the angle between an object's momentum vector p and the jet momentum pj e ^ is 
less than 90° and the transverse momentum of the object with respect to p + Pj e ^ is 
smaller than 0.5 GeV/c then, the object is added to the jet. Otherwise the object 
is used as a new jet initiator. The procedure is repeated until all objects have been 
assigned to a jet. 

• The distance between two jets is defined as Y = M 2 /E^ is where M is the invariant 
mass of the pair of jets, assumed to be massless, and E vis is the visible energy. The 
pair of jets with the smallest value of Y is merged provided Y < 0.1 and they are 
within 90° of each other. 

• The process of merging jets may result in objects having a larger transverse 
momentum with respect to the jet to which they have been assigned than to another 
jet. If this is the case the object is reassigned to the other jet. A maximum of five 
reassignments may occur after each merger. 

The last two steps are repeated until no pair of jets has Y < 0.1. 

5 b Tagging 

This analysis relies on the ALEPH b tagging software developed to identify b quarks via 
their long lifetimes [13]. It identifies charged tracks that appear to originate from a point 
away from the primary event vertex, and along the direction of the reconstructed b quark. 
The b tagging algorithm relies on the impact parameter of charged tracks to indicate the 
presence of long lived particles. The impact parameter is defined as the distance of closest 
approach in space between a track and the main vertex in the event. It is signed positive 
(negative) if the point of closest approach between the track and the estimated b hadron 
flight path is in front of (behind) the main vertex, along the direction of the b momentum 
estimated using the jets found by PTCLUS. The impact parameter significance S is 
defined as the signed impact parameter divided by its estimated measurement error. A 
fit to the negative S distribution is used to derive a function which when applied to a 
single track can be used to obtain P t rack, the probability that a track originated at the 
main event vertex. Only tracks which are likely to have S reliably measured are used, 
in particular they are required to have at least one associated VDET hit. The primary 
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vertex in an event is found using a procedure specifically designed for use in b tagging. 
Probabilities from tracks with positive S are combined to form tagging variables. Three 
tagging variables are used in this analysis. These are invent, -Pjeti, and Pj Ct2 which are 
respectively the probabilities that the whole event, the first jet or the second jet contained 
no decay products from long lived particles. 



6 Event Selection 

The preselection stage of the analysis identified events which were predominantly from 
low two-photon interactions. Events were required to have 

• at least 5 charged tracks; 

• invariant mass of all energy flow objects (W v i S ) between 4 and 40 GeV/c 2 ; 

• total energy in the luminosity calorimeters SiCAL and LCAL less than 30 GeV; 

• total transverse momentum of the event, relative to the beam direction, less than 6 
GeV/c; 

• thrust less than 0.97. 

The PTCLUS algorithm was used to find jets using all energy flow objects with | cos#| 
less than 0.94. This cut results in the b quark jets having similar properties in direct and 
resolved events. Between 1 and 3 jets were found and ranked by how close their mass was 
to the nominal b quark mass of 5 GeV/c 2 , with Jet 1 being the closest, Jet 2 the next 
closest, etc. After the preselection approximately 80% of the Jet 1 sample were within 
15° of a parton in the direct 77 — > bb Monte Carlo, while the corresponding figure for the 
resolved Monte Carlo was 70%. 



ALEPH ALEPH 




W vis (GeV/c 2 ) E jet1 (GeV) 

Figure 2: Distribution of W vis and the energy of Jet 1 in data and simulation after 
preselection. 
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From the distribution of W v[s , and the energy of Jet 1 shown in Fig. 2 it can be seen 
that the preselected sample is dominated by events containing light quarks. 

A further selection was applied to enhance the fraction of events from the signal 
process, 77 — > bb X. Events were required to have 

• at least 7 charged tracks; 

• invariant mass of all energy flow objects between 8 and 40 GeV/c 2 ; 

• at least two jets; 

• P cvcnt < 0.05; 

• the third largest impact parameter significance S greater than 0.0; 

• the fourth largest impact parameter significance S greater than -10. 

Figure 3 shows the distribution of W vis , and the energy of Jet 1 for events at this stage 
of the analysis. Comparison with Fig. 2 shows that while the proportion of the events 
in this sample originating from b quarks has increased compared to the preselection, the 
dominant source of events is still 77 — > uds and 77 — > ccX. 
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Figure 3: Distribution of W vis and the energy of Jet 1 in data and simulation after 
selection. 
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7 Iterative Discriminant Analysis 



In this analysis the likelihood that an event belongs either to the signal or to the 
background is determined by means of an Iterative Discriminant Analysis (IDA) [14]. The 
details of the method are described in the Appendix. The method generalises standard 
linear discriminant analysis and proceeds through a series of iterations. At each iteration 
% events are selected by applying a cut on the discriminant function for that iteration 
(Di) and a new discriminant function is then generated for the remaining events. The 
simulated samples described in section 3 were used to determine the IDA coefficients. A 
set of 11 variables was chosen as input to the IDA process, these were: 

m P p p 

w 1 event;- 4 jetl;- 4 jetzi 

• mass and transverse momentum of Jet 1; 

• the five highest track impact parameter significances S seen in the event; 

• the thrust of the event. 

After each IDA iteration the simulations of signal and background were used to choose 
whether to perform another iteration, and where to place the cut on D^. A series of 
possible values at which to apply a selection on were chosen starting with one that 
selects 100% of the signal and increasing in steps of 1% until no signal remained. At 
each step the significance of the expected signal above the cut was calculated by dividing 
it by the predicted error for the integrated luminosity in the data, including estimated 
statistical and systematic uncertainties. Having determined the value of Di at which the 
significance was maximal, the cut to be applied to the discriminant variable Di was set 
at a value A D lower. The value of A D was set to 1.5 for the first iteration, and halved at 
each subsequent iteration. This continued for three iterations after which there was no 
further improvement in the predicted significance. 

The coefficients of the discriminant analysis and and cut values derived from this 
procedure were then applied to the data. However in order to perform various systematic 
checks which will be described later, it proved necessary to loosen the cut on D 2 . This 
had no significant impact on the purity of the signal obtained. The final cut on _D 3 was 
chosen to maximise the size of the signal relative to its uncertainty (both statistical and 
systematic). Table 1 shows the fraction of the total event sample estimated to come from 
various sources and the number of events in the data, at different stages in the analysis. 
The distribution of the discriminant variables Di in the data and simulation is shown in 
Fig. 4 for each iteration of the IDA process. 

The final selection yielded 93 events in the data. The background was calculated using 
separate samples of simulated events from those used to tune the IDA parameters. It was 
found to consist of 18.8 events from 77 — > ccX, 3.9 events from 7*7 — > X and 1.5 events 
from e + e~ annihilation. 
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41 


50 
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data 




2696021 


16810 


244 


197 


93 



Table 1: Summary of the analysis. The first 5 rows show the cross section used for the 
simulation and the fraction (%) of each simulated subset at progressive stages of the 
analysis. The final row shows the number of events remaining in the data at each stage. 
The numeric column labels denote the analysis stages, they are (1) pre selection, (2) 
selection, (3) IDA iteration 1, (4) IDA iteration 2, (5) final cut on D 3 . 
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LO 




Figure 4: Distributions of the discriminant variable in data and Monte Carlo samples after 
each iteration of the IDA process. The points with error bars are the data, the dashed 
histogram is the simulated signal, the dash-dot histogram is the simulated background, 
and the solid histogram is the sum of signal and background simulations. Each distribution 
has been translated along the horizontal axis so that the selection cut is at zero. The 
signal simulation has been weighted according to the fit described in Section 8. 
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8 Efficiency Calculation 



The efficiency for signal events to pass the selection procedure was estimated using a 
separate sample of simulated signal events to that used to determine the IDA parameters. 
The efficiency is different for the direct and resolved components so in order to calculate 
the total efficiency, the relative size of the two components must be determined. This 
was found from the data by performing a fit to the x™ m distribution in the data after 
subtracting the background. The variable x™ m is defined as the smallest of x+ and x~ 
where 

7 (E tot ±pl ot ) ' 1 ; 

Here E l , p\ are the energy and longitudinal momentum of jet i, while E tot and p* ot are 
the energy and longitudinal momentum of the whole event. The sum is calculated for 
the highest and second highest energy jets in the event. The x^ variables are used in 
two-photon and photoproduction experiments to distinguish direct and resolved events. 
They represent the fraction of the incoming photon's four-momentum that has gone into 
the hard scattering process. For perfectly measured events the value of x 7 is identically 
1 for direct photons, and less than 1 for resolved photons, as in the latter case some of 
the photon's four momentum is taken away by the spectator jet. In practice direct events 
are characterised by having both x + and x~ larger than 0.75, while single resolved events 
tend to have either x + or x~ less than 0.75, and double resolved events have both values 
less than 0.75 [15]. In this analysis only direct and single resolved processes need be 
considered so x™ 11 can be used to separate them experimentally. 

The x™ 11 distribution is shown in Fig. 5 for data after subtracting background and 
the simulated direct and resolved components after fitting to the data. The result of 
the fit is that there are 30.8 ± 11.3 direct and 38.3 ± 11.9 resolved events in the data. 
The efficiencies are 0.022 for the direct term, and 0.016 for the resolved term. The mean 
efficiency is calculated to be 0.0184 ± 0.0009 where the error comes from the fit to the 
fraction of direct and resolved events. The trigger efficiency for events passing the final 
cut has been measured using independent triggers and found to be negligibly less than 
100%. 



9 Cross Section Calculation 

The total cross section is calculated as 

N — b 

a(e + e- -> e + e"bb X) = — — (2) 

where N is the number of events observed, b is the estimated background, e is the efficiency 
and C is the luminosity. With iV = 93, b = 24.2, e = 0.0184 and C = 698 pb" 1 the result 
is (j(e + e~ — > e + e~bbX) = (5.4 ± 0.8) pb where the error is statistical only. 
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Figure 5: The x™ m distribution. The points with error bars are the data after subtracting 
the background. The histograms show the distribution in the simulated direct and resolved 
signal after fitting to the data. 



10 Systematic Uncertainties 

10.1 Background Estimate 

The uncertainty on the background derives from the uncertainty on the cross section for 
each component. This is estimated to be 12.5% for 77 — > ccX [10], 40% for 7*7 — > X [16] 
and 3% for e + e" — > qq [17]. The resulting uncertainty on the background is 2.8 events. 

10.2 Monte Carlo Simulation 

To assess the sensitivity of the efficiency to the modelling of the physics channels a second 
sample of signal events was generated using the HERWIG program [18] (version 6.201). 
The difference in efficiency obtained using these events was 8.6%, and this has been used 
as a systematic error. The effect of varying the b-quark fragmentation function in the 
simulation was checked and found to be negligible. 

10.3 Wvis dependence 

Figure 3 shows some discrepancy in the W vis distribution at the highest values. To check 
whether this had any influence on the final result the analysis was repeated with the 
maximum accepted Wvis set to 30 GeV/c 2 . This resulted in the measured cross section 
dropping by 0.5 pb. This has been included as a conservative systematic error. 
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11 Cross Checks of the Analysis 



11.1 Stability with respect to the D3 cut 

An important check of the analysis comes from the dependence of the result on the D 3 cut. 
In Fig. 6 the cross section measurements obtained when varying the D 3 cut either side of 
the chosen value are plotted along with the uncorrelated errors of each point with respect 
to the point at the chosen cut. No systematic trend is observed. Similar studies on the 
D\ and D 2 cuts did not reveal significant effects so no additional systematic error was 
assigned. 



ALEPH 



7 



o 

CD 
CO 

CO 
CO 

o 
o 



6.5 



5.5 
5 
4.5 



3.5 



-0.6 -0.4 -0.2 



0.2 0.4 0.6 0.8 
Cut on0 3 



Figure 6: Stability of the cross section measurement with respect to changing the cut on 
D3. The total error is shown at the chosen cut value (D% = 0), while for the other points 
the uncertainties relate to the difference of each point with respect to the chosen cut. The 
bins are defined such that each contains 10 more data events than that to its right. 



11.2 Wvis distribution 

An independent test of the fit to direct and resolved components is given by the 
distribution of W v i S which is shown in Fig. 7. The direct and resolved components also 
have a significantly different distribution in this variable and together they give a good 
description of the data. 
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Figure 7: The distribution of W w - ls in selected 77 — > bb X data. Points with error bars 
are the data. The histograms show the distribution in the background, the direct and 
resolved signal and the sum of signal plus background. 



11.3 Semileptonic decays 

Approximately 20% of b quarks undergo semileptonic decays, in which an electron or a 
muon is generated from the W; therefore about 14 electrons and 14 muons are expected 
to be produced, on average, in the observed signal sample of 74 bb events, through direct 
semileptonic decays. Because of the large mass of the b quark, the leptons tend to be at 
higher transverse momentum relative to the accompanying jet than those from the decay of 
the lighter quarks. The production of leptons from semileptonic decays of the secondary 
charm in the b decay chain is also sizeable, but the selection efficiency is considerably 
smaller because of the softer momentum and transverse momentum spectra. All charged 
tracks with momentum greater than 2 GeV/c were considered as candidate electrons or 
muons. 

Muons were identified from the pattern of energy deposition left in the HCAL. In 
addition candidate muon tracks were required not be part of a track showing evidence of 
a kink in the TPC, to have at least 5 hits in the ITC, and have a dE/dx measurement in 
the TPC consistent with the expectation for a muon. 

Electrons were required to have a cluster in the ECAL whose transverse and 
longitudinal shape was consistent with that expected for an electromagnetic shower, and 
whose energy was consistent with the momentum measured in the TPC. In addition they 
were required to have at least one VDET hit and at least 3 ITC hits and not be from an 
identified converted photon. 

Simulation studies show that the majority of misidentified leptons or leptons not 
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originating from the decay of b hadrons are found at low transverse momentum relative to 
the nearest jet. Requiring the lepton transverse momentum to be greater than 1 GeV/c 
relative to the nearest jet leaves 0.1% of misidentified leptons and 2.5% from sources other 
than b hadron decays. 

Figure 8 shows the distribution of transverse momentum of electrons and muons with 
respect to the nearest jet in the final sample of events. If the lepton is included in the 
jet then its momentum has been subtracted from the jet before calculating the transverse 
momentum. The signal of 6 leptons is consistent with the prediction of 6 from the signal 
simulation plus 0.9 from the background. 
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Figure 8: The transverse momentum of electrons and muons with respect to the nearest 
jet in selected 77 — > bb X data. Points with error bars are the data. The histograms show 
the distribution in the background, the direct and resolved signal and the sum of signal 
plus background. 



12 Conclusions 

The cross section for the process e + e~ — > e + e~bb X has been measured to be 

a ( e + e - ^ e+e-bb X) = (5.4 ± 0.8 stat ± 0.8 syst ) pb 

which is consistent with the prediction of NLO QCD [1] of between 2.1 and 
4.5 pb but barely consistent with the result quoted by the L3 Collaboration [3], 
(12.8±1.7 stat ±2.3 syst )pb. 
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Appendix A Iterative Discriminant Analysis 

Discriminant analysis is a technique for classifying a set of observations into predefined 
classes. The purpose is to determine the class of an observation based on a set of input 
variables. The model is built based on a set of observations for which the classes are 
known. In standard discriminant analysis a set of linear functions of the variables, known 
as discriminant functions, are constructed, such that L = J2i=i n{^i x i) + c > where the 6's 
are discriminant coefficients, the Xi are the n input variables and c is a constant. In the 
method known as Iterative Discriminant Analysis [14] (IDA) the vector of input variables 
x is extended to include all their products x-iXj (i ^ j). In addition the process is repeated 
a number of times with a selection being applied at each iteration and a new discriminant 
calculated. In detail the IDA procedure works as follows: 

• For each event fill a vector y containing the n variables and (n 2 — n)/2 products of 
those variables. 

• Calculate the variance matrix V = V s + Vb, where V s is the variance matrix of the 
signal and V b is the variance matrix of the background; V s and V b are weighted so 
that they have equal importance. 

• Calculate A/i, the difference in the means of the signal and background, for each 
element of y. 

• Invert the variance matrix V and multiply by A/i, to obtain the vector of coefficients 
a = V^A/j. 

• For each event calculate D = y T ay. 

If necessary apply a selection to the events at some value of D and repeat the procedure 
as required. The IDA process does not prescribe how such a cut should be chosen, or how 
many iterations should be performed. 
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